Revive Tech Asia | Tech Conference & Exhibition

Source: The Verge - Nvidia reveals H100 GPU for AI and teases ‘world’s fastest AI supercomputer’ by James Vincent.

Nvidia has announced a slew of AI-focused enterprise products at its annual GTC conference. They include details of its new silicon architecture, Hopper; the first datacenter GPU built using that architecture, the H100; a new Grace CPU “superchip”; and vague plans to build what the company claims will be the world’s fastest AI supercomputer, named Eos.

Nvidia has benefited hugely from the AI boom of the last decade, with its GPUs proving a perfect match for popular, data-intensive deep learning methods. As the AI sector’s demand for data compute grows, says Nvidia, it wants to provide more firepower.

In particular, the company stressed the popularity of a type of machine learning system known as a Transformer. This method has been incredibly fruitful, powering everything from language models like OpenAI’s GPT-3 to medical systems like DeepMind’s AlphaFold. Such models have increased exponentially in size over the space of a few years. When OpenAI launched GPT-2 in 2019, for example, it contained 1.5 billion parameters (or connections). When Google trained a similar model just two years later, it used 1.6 trillion parameters.

“Training these giant models still takes months,” said Nvidia senior director of product management Paresh Kharya in a press briefing. “So you fire a job and wait for one and half months to see what happens. A key challenge to reducing this time to train is that performance gains start to decline as you increase the number of GPUs in a data center.”

Nvidia says its new Hopper architecture will help ameliorate these difficulties. Named after pioneering computer scientist and US Navy Rear Admiral Grace Hopper, the architecture is specialized to accelerate the training of Transformer models on H100 GPUs by six times compared to previous-generation chips, while the new fourth-generation Nivida NVlink can connect up to 256 H100 GPUs at nine times higher bandwidth than the previous generation.

The H100 GPU itself contains 80 billion transistors and is the first GPU to support PCle Gen5 and utilize HBM3, enabling memory bandwidth of 3TB/s. Nvidia says an H100 GPU is three times faster than its previous-generation A100 at FP16, FP32, and FP64 compute, and six times faster at 8-bit floating point math.

“For the training of giant Transformer models, H100 will offer up to nine times higher performance, training in days what used to take weeks,” said Kharya.

The company also announced a new data center CPU, the Grace CPU Superchip, which consists of two CPUs connected directly via a new low-latency NVLink-C2C. The chip is designed to “serve giant-scale HPC and AI applications” alongside the new Hopper-based GPUs, and can be used for CPU-only systems or GPU-accelerated servers. It has 144 Arm cores and 1TB/s of memory bandwidth.

Read the full article HERE.

Take the spotlight and Share your story

Revive Tech Asia offers a unique opportunity to spread good ideas that stimulate multi-faceted dialogue and inspire others to action. It doesn’t matter where you are from – established enterprises or fast-growing start-ups; If you think you might be a good speaker for Revive Tech Asia, or you know someone who would make a great speaker, let us know! We are always happy to hear from potential speakers with an exciting story to share.

Have you spoken at any tech event previously?

I agree to the terms and conditions and privacy policy.

We are committed to protecting your privacy. We will never collect information about you without your explicit consent.

Register Your Interest

Register your interest and we will make sure you receive the latest information about the industry, our events and exclusive offers!

Please tell us all the ways you would prefer to hear from us

Please tell us which content you’d like

I agree to the terms and conditions and privacy policy.

We are committed to protecting your privacy. We will never collect information about you without your explicit consent.

Follow us on

Nvidia reveals H100 GPU for AI and teases ‘world’s fastest AI supercomputer’
All built on the company’s new Hopper architecture

Quick Links

Useful Links

Contact Us

Nvidia reveals H100 GPU for AI and teases ‘world’s fastest AI supercomputer’ All built on the company’s new Hopper architecture

Take the spotlight and Share your story

Register Your Interest

Become a Partner / an Exhibitor

Join 10,000+ others to become a tech insider. And YES – it’s completely free.

Sure thing! You may find out more about RTA in our brochure. Contact us anytime if you want any additional information.

Nvidia reveals H100 GPU for AI and teases ‘world’s fastest AI supercomputer’
All built on the company’s new Hopper architecture